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Abstract. For a given subspace, the q-Rayleigh-Ritz method projects the 
large quadratic eigenvalue problem (QEP) onto it and produces a small sized 
dense QEP. Similar to the Rayleigh-Ritz method for the linear eigenvalue prob- 
lem, the q-Rayleigh-Ritz method defines the q-Ritz values and the q-Ritz vec- 
tors of the QEP with respect to the projection subspace. We analyze the 
convergence of the method when the angle between the subspace and the de- 
sired eigenvector converges to zero. We prove that there is a q-Ritz value 
that converges to the desired eigenvalue unconditionally but the q-Ritz vector 
converges conditionally and may fail to converge. To remedy the drawback 
of possible non-convergence of the q-Ritz vector, we propose a refined q-Ritz 
vector that is mathematically different from the q-Ritz vector and is proved 
to converge unconditionally. We construct examples to illustrate our theory. 



1. Introduction 

Consider the numerical solution of the large quadratic eigenvalue problem (QEP) 
(1.1) Q{X)x={X^M + XD + K)x = 0, 

where X E C, x E C"\{0}, A/. D and K are n x n complex matrices with M = 
> 0, Hermitian positive definite. The scalar A and the nonzero vector x in 
are called an eigenvalue and a corresponding eigenvector of the quadratic 
pencil Q(A) or {M,D,K), respectively. The pair (A,a;) is called an eigenpair of 
{M,D,K). Since M = > in Q{X) has 2n finite eigenvalues. 

QEP arises in a wide variety of scientific and engineering applications [21 [27]. 
The theoretical framework for general matrix polynomials and in particular for 
quadratic pencils can be found in books by Lancaster |18j and more recently by 
Gohberg, Lancaster and Rodman [5]. A good survey of mathematical properties, 
perturbation analysis, and a variety of numerical algorithms for QEPs can be found 
in the paper by Tisseur and Meerbergen . 

In practice, a small number of eigenvalues that are nearest to a target r or locate 
in a prescribed region of the complex plane and the corresponding eigenvectors are 
often of interest. To this end, we exploit the shift transformation Ar = A — r with 
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det(Q(T)) 7^ to transform to a new QEP of the form 

(1.2) QriXr)x = [XlMr + KDr + Kr)x = 0, 

where Mt — M, Dr — 2tM + D and Kr ~ t^M + tD + K is nonsingular. 
Throughout the paper, we assume that the eigenvalues to be sought are nonzero. 

One kind of classical methods for solving QEP (|l.ip is to reformulate it as a 
certain standard (or generalized) eigenvalue problem via a so-called linearization 
process and then to apply Krylov subspace based methods or Jacobi-Davidson 
type methods to solve the corresponding linear eigenvalue problem. Most of these 
methods fall into the category of the Rayleigh-Ritz method that is widely used for 
the computation of eigenpairs of a linear eigenvalue problem from a given projection 
subspace. As is well known, under the assumption that the angle between a desired 
eigenvector and the projection subspace tends to zero, there exists a Ritz value 
that converges to the desired eigenvalue unconditionally but its corresponding Ritz 
vector may fail to converge; furthermore, when one is concerned with eigenvectors, 
one can compute certain refined Ritz vectors whose convergence is guaranteed |101 
HainillllllS]; see also [25]. 

Over the years, some reliable numerical methods have been proposed that are 
used to solve large and sparse QEPs directly. Based on the Ritz-Galerkin condition, 
various methods are designed to construct suitable lower dimensional subspaces. 
Then, the large QEP is projected onto a given subspace to produce a small sized 
dense QEP which can be solved by the standard QR or QZ algorithm. They fall 
into the category of the q-Rayleigh-Ritz method, as will be described in the next 
paragraph. Methods of this type include the residual iteration method [SI [211 [22] , 
the Jacobi-Davidson method (131 [23] , Krylov type subspace methods [5] [TO] , the 
nonlinear Arnoldi method [15] , second-order Arnoldi (SOAR) type methods [TJ [T7J 
[20] [29], the iterated shift-and- invert Arnoldi method and the semiorthogonal 
generalized Arnoldi (SGA) method [7]. 

Now we describe the q-Rayleigh-Ritz method for the QEP. For a given orthonor- 
mal matrix Q G c^-x™ < ^^^^ ^j^e q-Rayleigh-Ritz method is to find a scalar 
/X e C and a unit length vector x G C" satisfying the Ritz-Galerkin condition 

{fi^MQ + i^iDQ + KQ)x _L span{Q}, 

which amounts to solving the projected QEP 

(1.3) {fi'^AI + fiD + K)x^O, 
where 

(1.4) M = Q"MQ, D = Q"DQ, K = Q"KQ. 

If (/i, x) with llxjl = 1 is an eigenpair of (A/, D, K), i.e., [iJ? M + ^iD + K)x = 0, then 
^ and Qx are, respectively, called a q-Ritz value and a corresponding q-Ritz vector 
of {M,D,K) with respect to span{(3}, and (/i, Qx) is a q-Ritz pair of {M, D, K). 
Since AI is Hermitian positive definite, so is M for any given Q. Therefore, we have 
2m finite q-Ritz values. 

In this paper we study the convergence of the q-Ritz value and the corresponding 
q-Ritz vector, and extend some of the results in [15] [16] [25] to the q-Rayleigh- 
Ritz method. Although a number of q-Rayleigh-Ritz procedures with respect to 
different subspaces have been used, to our best knowledge, there has been no unified 
convergence result and general theory. As will be seen, carrying out this task is 
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mathematically nontrivial. Similar to the linear eigenvalue problem, it appears that 
there exists a q-Ritz value that converges to the desired eigenvalue unconditionally 
but the corresponding q-Ritz vector may fail to converge even if the corresponding 
projection subspacc spanjQ} contains a sufficiently accurate approximation to the 
desired eigenvector. It is thus necessary and significant to replace the q-Ritz vector 
by a refined q-Ritz vector that has residual minimization and is mathematically 
different from the q-Ritz vector. We prove that the refined q-Ritz vector converges 
unconditionally provided that the angles between the desired eigenvector and the 
subspaces tend to zero. All convergence results are nontrivial generalizations of 
the known results on the Rayleigh-Ritz method and their refinement for the linear 
eigenvalue problem in [151 [El ES] . 

This paper is organized as follows. In Section [21 we analyze the convergence for 
q-Ritz values and q-Ritz vectors and prove that the q-Ritz value is uncondition- 
ally convergent but the associated q-Ritz vector may fail to converge. To remedy 
this drawback, in Section [3l we introduce a refined q-Ritz vector and prove its 
unconditional convergence. Finally, we conclude the paper in Section |4l 

Throughout this paper, the superscripts H and T denote the conjugate transpose 
and the transpose of a matrix or vector, respectively. /„ is the identity matrix of 
dimension n. We denote by || • || both Euclidean vector norm and the spectral 
matrix norm. 



2. Convergence of q-Ritz values and q-Ritz vectors 



Throughout the paper, let (Ai,a::i) with ||a;i|| = 1 be a desired eigenpair of 
(M, D, K) and assume that Ai 7^ is simple. 

We convert the QEP (|l.ip to a generalized eigenvalue problem (GEP) of the 
form 



(2.1a) 
where 
(2.1b) 



A 



A 



Xx 

X 


= XB 


Xx 

X 


-K ' 



and 


B ^ 



So Ai is an eigenvalue of {A, B) in (|2.ip and ^1 



M 
/„ 

Xixi 

Xi 



/v/l + |AiP is its 



corresponding normalized eigenvector. There are unitary matrices [^1, X] and 
[yi, Y] e C^^x^n ^i^j^ e such that 



(2.2) 



Vi' 



A [Ci X] = 



L 



yH 



B [6 X] = 



/3 t"' 
N 



where L,N <E c(2n-i)x(2n-i) g^^j^ \^ — q,^-i Since Ai is supposed to be simple, it 
is not an eigenvalue of {L, N). 

For a given orthonormal matrix Q e C"^™ with m < n, define 



(2.3) 



W = 



Q 
Q 



and let [QjQ^] be unitary with Q-^ e £^nx(n m) Pyoui now on, throughout the 
paper, let 9i be the acute angle between xi and the projection subspace spanjQ} 
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and 

(2.4) qi^Q"x,, qi^{Q^)"x^. 

Then it holds that [25l p. 249, Theorem 2.2] 



(2.5) 



1 91 



smOi, \\qi\ 



1 — sin" Oi = cos ( 



First of aU, we want to show that there is a q-Ritz value fii that converges to Ai 
unconditionally when sin^i — > 0. The first result towards this aim is the following 
perturbation theorem. 

Theorem 2.1. With Ai, qi and 9i defined as above. Let M, D and K he defined in 
(jl.4|) and qi ~ qi/\\qi\\- Then there are perturbation matrices £-f^,£g,£^ € j^mxm 
with 



(2.6a) 
(2.6b) 
(2.6c) 



ii^Bii < I 



II^Bll < 5 



1 



mo 



1 



fco ) tan 6i 



\Xi\mo + do + -p^^o ) tan 6*1, 



\\£k\\ < 3 (lAipmo + |Ai|do + tanOi, 



H 



Xl, 



such that (Ai , ^i) is an exact eigenpair of the perturbed {M + D + £q, K + £^), 
where 

(2.7) mo = p/||, do = ||-D||, fco = ||i^||. 
Proofi Recalling p.4p and (|2.5p . since 

= {XjM + XiD + K)xi = {XjM + XiD + K)[ Q ] 

we obtain 

(2.8) XjMQqi + XiDQqi + KQqi = - {XlM + XiD + K) Q^q^. 
Pre- multiplying (|2.8p by gives 

(2.9) ri = (A?M + XiD + K)qi = -{XIQ"M + XiQ" D + 0«if)Q^/^ 



So we have 
Define 



kill < (|Aip77io + |Ai|do + /so)tan6ii 



By (|2.9p it is easily seen that H^^rrll, 11^5 1| and ||£|f || satisfy (|2.6p and 



Xl{M + £ry) + Xi{D + £g) + {K + £ 



Kl 



qi =0. 



□ 
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We may deduce from this theorem that there exists an eigenvalue of (A/, D, K) 
that converges to Ai as 9i — > 0. However, things are subtle and by no means trivial 
here. The difficulty is that, unlike a usual matrix perturbation problem where ma- 
trices are given and fixed and perturbations are allowed to change^ here the matrix 
triple (M, D, K) and the perturbation triple {£j^, Sq, £jf) c/ia^^e simultaneously as 
9i —7- 0. This means that there may be a possibility that, as 9i changes, the eigen- 
value Ai of {M + £j^, D + £q, K + Sf^) and the eigenvalues of (M, D, K) become 

ill conditioned so swiftly that no eigenvalue of (M, D, K) converges to Ai though 
6*1 ^ 0. 

Fortunately, by exploiting a theorem of Eisner [4] (also see [26l p. 168]) we can 
prove that this cannot happen and there is indeed an eigenvalue that converges 
to the desired Ai provided that 6i 0. Eisner's theorem states that, given matrices 
C and C of order n, for any eigenvalue A of C there is an eigenvalue A of C* such 
that 

|A-A|<(||Cl| + ||C||)i-^||C-C'||^. 



For our purpose, define the matrices A and B by 



(2.10) 



A 



-D -K 
!„,. 



B 



M 
/™ 



Then the eigenvalues /i of {M,D,K) are equal to those of {A,B), whose nor- 



malized eigenvectors ^ 



'1 + l/ip with X the eigenvectors associated 



with the eigenvalues fi of {M,D,K). Since M is Hermitian positive definite, so is 
B. Therefore, all the /z are the eigenvalues of B~^A. Furthermore, it holds that 
^ for any given orthonormal Q and Hermitian positive definite M. 

From Theoreni l2.H Ai is an eigenvalue of {A + £-^, B + £g) with the perturbation 

matrices 



D 



-£ 




K 



£s = 



£ 



M 





i.e., an eigenvalue of {B + £ {A + £ ^) if {]3 + £g)^^ exists. Since B is Hermitian 
positive definite and its smallest singular value is bounded by that of B from below, 
B + £g must be nonsingular for 9i small enough. Moreover, for 9i 0, it follows 
from Theorem 12 . 1 1 that 



(2.11) 



KB 



= \\B-'+Oi£f 



\B- 



< \\B- 



is uniformly bounded independent of 9i. Since \\A\\ is always bounded from above 
as < ||i:»|| and < ||Al, it follows that \\B-^A\\ < is uniformly 



bounded independent of 9i. As a result, for 6*1 — J- 0, since A 
from (PTTj) and Theorem EH that 



A, it follows 



\{B + £g)-\A + £;i)\\ < \\{B + £^ 



is uniformly bounded independent of 9i . 

Finally, from Theorem 12.11 and {B + £g 
that 



B- 



mA+£^)\\ 

0{£g), it is easily justified 



\B-^A 



iB + £g)-\A + £^)\\=0isin9^). 
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Based on Eisner's theorem, we have the following corollary, which, together with 
the above discussions, proves the global unconditional convergence of q-Ritz values 
when Oi 0. 

Corollary 2.2. Assume that 0i is small enough to make B + £g nonsingular. 

There is a q-Ritz value such that 

(2.12) 

\f^i-Xi\<{\\B-^A\\ + \\{B+8g)-\A+£^)\\r-^\\B-'A-{B+8gr^ 

The corollary indicates that as 0i — ^ there is always a q-Ritz value /ii — > Ai 
unconditionally. We should comment that bound (|2.12p will in general be a too 
gross overestimate and be for the worst case. If, as usually happens in practice, the 
condition number of Ai as an eigenvalue of {B + £g)^^{A + is bounded, the 
convergence will be linear in 9i , much better than that predicted by bound (|2.12p . 

Next, we analyze the convergence of the corresponding q-Ritz vector xi. Based 
on decomposition (|2.2p . we can establish the following theorem, which is an analogue 
of Theorem 3.1 in [9] for the standard linear eigenvalue problem. The result will 
be used when we prove the unconditional convergence of refined q-Ritz vectors to 
be introduced in the next section. 

Theorem 2.3. Let with \\^i\\ = 1 be an approximation to (Ai,^i) of the 

matrix pair (A, B) with = 1. Let 

(2.13) r = All - fiiB^i 

be the residual of (/ii, ^i), and define scp(/ii, {L, N)) := \\ (L — ^iN)^^\\^^ . Then 

Proof From ([2^ . pre-muhiplying ([2TT3)) by leads to 
Y^r =Y" (ayiCf + Vi^" + YLX") 

^ MiF^ (/3yiCf + yit"x" + YNX^) I 
= {L^fi,N)X"i,. 

Therefore, it follows from ||X^|i|| = sinZ(^i,|i) that (PH)) holds. □ 



In terms of the posteriori computable residual r. Theorem 12.31 establishes the 
relationship between the eigenvector and its approximation for the generalized 
eigenvalue problem (|2.1|) . 

Let (/ii,ii) with = 1 be the eigenpair of {M,D,K) that is supposed to 

approximate the desired eigenpair (Ai, xi) of (M, D, K). In terms of Ox, we attempt 
to derive one of our main results, an apriori bound for the q-Ritz vector xi as an 
approximation to the eigenvector xi. Note that ni is an eigenvalue of {A,B) and 

^ilXl 



/ Y^l + Imi P is its corresponding normalized eigenvector. Similar to 

([221), there arc unitary matrices [|i, X] and [yi, Y] e C^^^^^m .^-^j-^ ^^ ^g^ ^ £.2m 
such that 
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where L, N e c(2™-i)x(2™-i) and /n = al3-\ Under the only hypothesis that 
sin 6*1 — > 0, it is possible that there is an eigenvalue of (L, IV) that could be arbitrar- 
ily near or even equal to fii. For a multiple and derogatory ^i, that is, has more 
than one trivial or nontrivial Jordan blocks, there are more than one ii = Qxi 
to approximate the unique eigenvector xi of (M, D, K). In this case, the method 
itself cannot tell us which one or linear combination of them is meaningful, so that 
it fails. If III is near an eigenvalue of (L, N), we will get a unique ii, but there is 
no guarantee that it converges to xi. It leads us to postulate that xi will converge 
provided that sep(Ai, (L, iV)) is uniformly away from zero independent of 0i, i.e., 
sep(Ai, {L,N)) > c with c a positive constant independent of 9i. We will, quanti- 
tatively, show that it is indeed the case. Before proceeding, we need the following 
lemma. 





U2 




U2 


u = 


and u ~ 










Ul 



where Ui,Ui G C" for i ~ 1,2 and 



Ul 



Ul\ 



1 . Then 



Proof. Since 



sin Ul) < min{||u|| , \ \u\\} sinZ(u, u). 

1, from the definition of sin Z(u, u), we have 



sm 



u 







2 








-"1 




"2 




min 




+ 




•) 




\M 




\w\\ 





> min 

a 

1 



Ul 

WuW 



aui 



||U||2 
1 



mm Ul — aui\ 



:{ui,ui). 



In the same way, we can also prove that 

sinZ(ui,ui) < ||u|l sin Z(u, u) 
Therefore, the assertion holds. 



□ 



Theorem 2.5. Let {A,B) be defined in (|2.10p and it have decomposition (|2.15p . 
Suppose that the q-Ritz pair (/xi,ii) is used to approximate the desired eigenpair 
{\i,xi) with \\xi\\ = llxill = 1. // sep(Ai, {L, N)) > 0, then 

lAipmo + |Ai|do + ^0 



(2.16) 



sinZ(xi,ii) < sin 6*1 



sep(Ai,(L,7V)) 
where tuq, dg and fco are defined in (j2.7p . 

Proof. By the triangle inequality we have 

(2.17) /(xi,xi) < Z{xuQQ"xi) + Z{QQ" xi,xi). 

From (12.41) and (12.51). we have 



tan 6*1 , 



(2.18) cosZ{xi,QQ"xi) = '^f.^^''^' ^ |ig^.Ti| 

\\QQ"xi\\ 



= cos Ul . 
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Let qi = \^hI\w From (|2l7| and ([238]) we get 

sin Z(xi, ii) < sin 9i + sinZ(QQ'^a;i, ii) = sin 6'i + sin Z{Qqi, Qxi] 
(2.19) =sin6'i +sinZ(ii,gi). 



From p.lOp . it is easily seen that 



fllXl 

Xl 



) is an eigenpair of {A, B) 



So we can regard {Xi,q 



qi 



) as an approximation of Then the 



residual of (Ai, q) as an approximate eigenpair of ( A, B) is 



D 



-K 




Aigi 

qi 



-Ai 



' M ■ 




Aigi 


. /,„ 




^1 

















AfM + AiD + JvJ gi 


By (|2.9p in the proof of Theorem 12.11 we have 

M = Mil < |AiprTto + |A i|rfo + /co 
Nil - HII 
From Lemma Theorem [Ol and (P?^ . inequality (PH)) satisfies 



(2.20) 



tan 6*1. 



sinZ(a;i,ii) < sin 6*1 + sinZ(a;i,(ji) 

< sin 6*1 + ll^ll sinZ(^i,g) 

\\r\\im 



< sinb*! 

< sin 01 



sep(Ai,(i,iV)) 
[AipTOp + \\i\do + fcp 
sep(Ai,(i,iV)) 



tan 01. 



□ 



From Theorem 12.51 we see that sep(Ai, (L, N)) > uniformly is a sufficient con- 
dition for the convergence of the q-Ritz vector xi. Furthermore, from Theorem l2.1[ 
since the q-Ritz value fj,i approaches the eigenvalue Ai as 9i — > 0, by the continuity 
argument wc have sep(/xi, {L,N)) — >-sep(Ai, {L,Nj). However, as we have argued 
above, sep(^i, [L, N)) can be arbitrarily small (and even be exactly zero) when ^i 
is arbitrarily near other eigenvalues (or is associated with a multiple eigenvalue) 
of {L,N). Consequently, while the q-Ritz value converges unconditionally once 
6i — > 0, the corresponding q-Ritz vector may fail to converge or may converge very 
slowly or irregularly. 

In the following, we give an example to illustrate that the q-Ritz vector fails to 
converge to the desired eigenvector. 

Example 2.6. Consider QEP pTT]) with 





' 1 


1 


" 




' -5.5 


-5 


" 




' 6 


6 


" 


M = 


1 


2 


1 


, D = 


-5 


-11 


-3 


, K = 


6 


9 


2 







1 


2 







-3 


-4 







2 


2 



It is easy to see that M and K are symmetric positive definite and (1, [0, 0, 1]-'") is 
an eigenpair of the QEP. 
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Suppose that we have come up with an orthonormal basis 





v/73 
3 





Then we have sin 9i = Q exactly, and the projection matrices are 

2 



M = Q"MQ = 

D = Q^DQ = 

K = Q"KQ = 
from which it foUows that 



3 
/73 



3 

34 
73 



9 
/73 



6 
V73 



9 

V73 
211 
73 



6 

" V73 
177 
73 



M + D + K^Q. 

Since M + D + K is zero, any nonzero vector xi with ||ii|| = 1 is an eigenvector of 
(M, D, K) corresponding to the double eigenvalue one, a q-Ritz value equal to the 
desired eigenvalue exactly. However, the q-Rayleigh-Ritz method itself cannot tell 
us how to pick up a suitable xi . In practice, we might well take xi = [l/\/2, 1 / 
and then the approximate eigenvector becomes [4:\/2/ \/73, — 3/V146, I/a/2]"^, 
which has no accuracy as an approximation of the desired eigenvector [0,0,1]^ 
and is completely wrong. Thus the method can fail even though the projection 
subspace spanjQ} contains the desired eigenvector exactly. 

In practice, we would not expect spanjQ} to contain xi exactly. Let us investi- 
gate the case that spanjQ} contains an enough accurate approximation to xi, i.e., 
sin^i is very small. We perturb Q by a matrix generated randomly in a normal 
distribution by 10^^^ x randn(3,2) whose 2-norm is 2.2 x 10^^^, and the resulting 

-,-12 



smt 



= 1.7 X 10" 



The orthonormalized 



Q QiQ^Q) 



-1/2 _ 



-0.000000000001074 0.936329177568703 
-0.000000000001425 -0.351123441589302 
1.000000000000000 0.000000000000506 



and 



M = 



D 



1.999999999997149 
-0.351123441589253 

-3.999999999991449 
1.053370324770698 



-0.351123441589253 
0.465753424656353 

1.053370324770698 
-2.890410958899234 



K 



1.999999999997149 -0.351123441589253 
-0.351123441589253 0.465753424656353 

We use the Matlab function polyeig.m to solve the projected QEP, and the computed 
Ml = 1.000000000009369 and the associated eigenvector 

XI = [0.999982126253304, -0.005978894038382]^. 
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So the q-Ritz vector 

xi = Qxi = [-0.005598212938803,0.002099329850230,0.999982126253300]'^ 
and 

siiiZ(xi,£i) w 0.005979, 

at least nine orders bigger than sin^i! so ii is a very poor approximation to xi for 
the given accurate subspace spanjQ}. It is also justified that the residual norm of 
the q-Ritz pair is 

||(Ai?M + /iiD + /v)fi|| w 0.011958. 

The poor accuracy of xi is due to the fact that there is another q-Ritz value 
II = 1.000000000010143 that is very near to //i, so that sep(Ai, (L, N)) in (fSH]) is 
tiny. 



3. Convergence of refined q-Ritz vectors 

As we have seen in Section [2l the q-Ritz vector may fail to converge or converges 
very slowly. Since the q-Ritz value is known to converge to the simple eigenvalue 
Ai when sin^i — > 0, this suggests us to deal with non-converging q-Ritz vector by 
retaining the q-Ritz value but replacing the q-Ritz vector with a unit length vector 
zi G spanjQ} with a suitably small residual. Naturally, for a given q-Ritz value 
Hi we construct zi = Qzi, where the unit length zi is required to be the optimal 
solution 



(3.1) zi = arg min || (^ilM + fuD + K) Q 



The vector zi = Qzi is called a refined q-Ritz vector of (M, D, K) corresponding to 
with respect to span{Q}. Obviously, zi is the right singular vector of the nx m 
rectangular matrix {^i^M + fj,iD + K) Q associated with its smallest singular value. 
We can compute zi reliably by a standard SVD algorithm or generally cheaper but 
still numerically stable cross-product based SVD algorithms; see [TTJ [T7j and also 
|25j . For a detailed round-off error analysis on the latter ones, we refer to |14| . 

Before establishing the convergence of the refined q-Ritz vector zi , we need two 
lemmas. 



Lemma 3.1. For W defined in (|2.3p . let (Ai,a;i) with \\xi\\ ^ 1 be the desired 

Xixi 

Xi 



eigenpair of (Af , I?, K) and ^ 



/ \/l + |Aip. Then it holds that 



(3.2) 



sinZ(^i,span{M^}) = sin(?i. 
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Proof. By p.3p and the definition of sin 0i , we liave 
sin2Z(^i,span{M^}) 



1 


n 


- 1 Ai P 




1 


n 


-1 Ai P 




Ai P 


n 


- 1 Ai P 




Ai P 





Xixi 




u 






^1 




V 





mill (II Aixi — u|p + llxi — u|p) 

espan{Q} 

1 



Sill^ 0^. 



mm W'Xi — u\ 



1+ I Al p i;Gspan{Q} 



mm \\xi — v\\ 



sin^ 01 



1 



1+ I Al P 



sm 01 



□ 



Lemma 3.2. Let {A,B) be defined in (j2.1b[) . It holds that 
and the minimum attains at z\ 



(3.3) min 

ll^ll— 1 



Qz 



= v/T+T7irp min \\{y?iM + ^iiD + K) Qz\ 



Proof. Without the minimizations, for any m dimensional vector z, it is direct to 
verify that the two hand sides are equal. So the assertion holds. □ 

Theorem 3.3. Let /ii be the q-Ritz value of {M,D,K) approximating the desired 
simple eigenvalue Ai. Suppose scp(/ii, (L,N)) > 0, where L,N are defined in 
Then we have 
(3.4) 



sinZ(xi, zi) < 
Proof. Let = 



v/1 + |AiP(|Ai - /ii|||S|| + \\A- fiiB\\ sm0i) \Xi - ^ll\\\A - niB\\ 



cos ^^isep(/ii, {L, N)) 



sep(^i, {L,N)) 



Xixi 

Xi 



'I + I Alp. From Lemma [2.41 we have 



3inZ(xi,zi) < + l/iip sinZ( 

= Vl + |yUiPsinZ(a,5), 



Xixi 








1 


Qzi 



where z 



Z2 



fiiQzi 
Qzi 



'1 + 1/^1 p. Let Pw be the orthogonal projector 



XiQQ"xi 
QQ"xi 



onto the subspace span{VF}, where W = diag((5, Q). Then 
Therefore, we get 



{Xl- ^ii)QQ^xi 




,,,/o Q>i 
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which is an approximate eigenvector of the desired form in the left-hand side of 
p.3p and j is a minimizer candidate for p.3p . Define 

/ = (/„ - Pw)^i + h 

with 

/2 = 

Then from cos6'i = HQ^xiH we have 

II/2II 



{\i - lii)QQ" xi 




costal 



< |Ai - /ill- 



From Lemma [??T] we get |l(/„ — = -y/l + |AiP sin^i. Therefore, we obtain 

(A-/iiB)(Fwa-/2) 



(A-/xii?)ei 



cos til 
cos 01 

(Ai - - (A - MiS)((/„ - Pw)^i + /2) 



cos Oi 

Taking the norms gives 



cos Oi 

From Lemma 13.21 by the optimality property of z we have 



yrTRF COS Si \/rTRi 

direct from 
{A-fiiB)2\\ 



Since IK^ jg ^ residual norm, it is direct from Theorem 12.31 that 



sinZ(6,z)< — p 

' ' - yTTRFscp(/xi,(L,iV)) 

Therefore, it holds that 



iZ{xi,zi) < a/I + l/iiP sinZ(^i, 2) 



~ cos6lisep(/ii, (L, iV)) sep(/ii, (L, iV)) 

which proves the assertion. □ 



Since /Ui is shown, as Corollary 12.21 indicates, to converge to Ai as 9i — >■ 0, 
we have sep(/xi, {L,N)) — !• sep(Ai, {L,N)), a positive constant independent of 61, 
provided that Ai is a simple eigenvalue of {M,D,K). So the refined q-Ritz vector 
zi converges to xi once sin 6*1 — >■ 0. 

We continue Example 12.61 to show considerable merits of refined q-Ritz vectors. 
For the case that xi lies in spanjQ} exactly, recall that fii = Ai exactly. It is easy 
to verify that the smallest singular value of the matrix {nfM + ^iD + K)Q is both 
exactly zero and simple, the optimal solution zi ~ [1,0]^ in p.ip and the refined 
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q-Ritz vector zi = Qzi = xi, exactly the desired eigenvector! So in contrast to the 
q-Ritz vector, the refinement can pick up the desired eigenvector perfectly. 

For the case that spanjQ} is perturbed in the way described in Example 12. 6[ 
the optimal solution in is 

zi = [1.000000000000000,0.000000000006175]'^ 

and the refined q-Ritz vector 

zi = [0.000000000004708,-0.000000000003593,1.000000000000000]^. 

So 

sinZ(xi,fi) = 5.9 x 10-l^ 

which is small up to sin^i ~ 1.7 x 10~^^ and much more accurate than the corre- 
sponding q-Ritz vector xi. Meanwhile, the computed residual norm of the refined 
approximate eigenpair (/ii, zi) is 

II ifilM + fiiD + K)zi\\ = 1.3 X 10-^^ 

eleven orders smaller than that of the q-Ritz pair 

4. Conclusions 

Theoretically, we have proved that there exists a q-Ritz value of (M, D, K) that 
unconditionally converges to the desired eigenvalue when the angle between the 
subspacc span{(5} and the desired eigenvector tends to zero. However, the asso- 
ciated q-Ritz vector only converges conditionally. To this end, we have proposed 
the refined q-Ritz vector that is guaranteed to converge unconditionally. We have 
presented some examples to demonstrate our theory. 

The purpose of this paper is not to present efficient and reliable eigensolvers for 
QEPs, but rather to establish a general convergence theory of the q-Rayleigh-Ritz 
method and to show the unconditional convergence of q-Ritz values and refined 
q-Ritz vectors and the conditional convergence of q-Ritz vectors. Refined q-Ritz 
vectors may become a very valuable component and make great improvement in 
ficxible eigensolvers for QEPs. Numerical experiments in |17| have shown that 
one can gain very much by replacing q-Ritz vectors by refined q-Ritz vectors in 
second-order Arnoldi type methods and their implicitly restarted algorithms. 
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